SEQUENCE LISTING 

<110> WALLAART, Thorvald Eelco 
BOUWMEESTER, Hendrik Jan 

<120> Transgenic Amorpha-4, 11-Diene Synthesis 

<130> 702 010272 

<140> 09/763,822 
<141> 2001-02-26 

<150> PCT/EP99/06302 
<151> 1999-08-27 

<160> 14 

<170> MS Word 97 SR-2 

<210> 1 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> EcoR I (Not I) adapter 
<400> 1 

gtcgacgcgg ccgcg 

<210> 2 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> EcoR I (Not I) adapter 
<400> 2 

cagctgcgcc ggcgcttaa 

<210> 3 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer C) used in PCR amplification 
<400> 3 

gtcgacaaac catggcactt acagaag 

<210> 4 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense primer (primer D) used in PCR amplification 



<400> 4 



r 



ggatggatcc tcatatactc ataggataaa eg 32 

<210> 5 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer G) used in PCR amplification 
<400> 5 

gaggatccat gtcacttaca gaa 23 

<210> 6 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense primer (primer H) used in PCR amplification 
<400> 6 

atggatcctc atatactcat agga 24 

<210> 7 
<2\\> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Sense primer (primer E) used in PCR amplification 
<400> 7 

cgagaattca tgtcacttac ag 22 

<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Antisense primer (primer F) used in PCR amplification 
<400> 8 

ggatctcgag tcatatactc at 22 

<210> 9 
<211> 538 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleotide sequence of probe generated by PCR with primers A 
and B 

<400> 9 

gatgagaatg ggaaatttaa ggaatcgtta gctaatgatg ttgaaggttt gcttgagttg 60 
tacgaagcaa cttctatgag ggtacctggg gagattatat tagaagatgc tcttggtttt 120 



acacgatctc gtcttagcat tatgacaaaa gatgettttt ctacaaaccc cgctcttttt 180 



accgaaatac 


aacgggcact 


aaagcaaccc 


ctttggaaaa 


ggttgccaag 


aatagaggcg 


240 


gcgcagtaca 


ttcctttcta 


tcaacaacaa 


gattctcata 


acaagacttt 


acttaaactt 


300 


gctaagttag 


agttcaattt 


gcttcagtca 


ttgcacaagg 


aagagctcag 


ccatgtgtgc 


360 


aaatggtgga 


aagcttt cga 


tatcaagaag 


aacgcacctt 


gtttaagaga 


tagaattgtt 


420 


gaatgctact 


tttggggact 


aggttcaggc 


tatgagccac 


agtattcccg 


ggctagagtt 


480 


ttcttcacaa 


aagctgttgc 


tgttataact 


cttatagacg 


acaccttcga 


cgctacgg 


538 



<210> 10 
<211> 179 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Deduced amino 
primers A and 

<400> 10 

Asp Glu Asn Gly Lys 
1 5 
Leu Leu Glu Leu Tyr 
20 

lie Leu Glu Asp Ala 
35 

Thr Lys Asp Ala Phe 
50 

Arg Ala Leu Lys Gin 
65 

Ala Gin Tyr lie Pro 
85 

Leu Leu Lys Leu Ala 
100 

Lys Glu Glu Leu Ser 
115 

Lys Lys Asn Ala Pro 
130 

Trp Gly Leu Gly Ser 
145 

Phe Phe Thr Lys Ala 
165 

Asp Ala Thr 

<210> 11 
<211> 2112 
<212> DNA 

<213> Artemisia ann 
<220> 

<223> Nucleotide sequence of a positive clone (amorphadiene synthase 

encoding gene) isolated from the cDNA library of induced A. annua 

<400> 11 

aattcgcggc cgcgtcgaca aatcatgtca cttacagaag aaaaacctat tcgccccatt 60 
gccaactttc ctccaagcat ttggggagat cagtttctca tctatcaaaa gcaagtagag 120 



acid sequence of probe generated by PCR with 
B 



Phe 


Lys 


Glu 


Ser 


Leu 


Ala 


Asn 


Asp 


Val 


Glu 


Gly 










10 










15 




Glu 


Ala 


Thr 


Ser 


Met 


Arg 


Val 


Pro 


Gly 


Glu 


He 








25 










30 






Leu 


Gly 


Phe 


Thr 


Arg 


Ser 


Arg 


Leu 


Ser 


He 


Met 






40 










45 








Ser 


Thr 


Asn 


Pro 


Ala 


Leu 


Phe 


Thr 


Glu 


He 


Gin 




55 










60 










Pro 


Leu 


Trp 


Lys 


Arg 


Leu 


Pro 


Arg 


He 


Glu 


Ala 


70 










75 










80 


Phe 


Tyr 


Gin 


Gin 


Gin 


Asp 


Ser 


His 


Asn 


Lys 


Thr 










90 










95 




Lys 


Leu 


Glu 


Phe 


Asn 


Leu 


Leu 


Gin 


Ser 


Leu 


His 








105 










110 






His 


Val 


Cys 


Lys 


Trp 


Trp 


Lys 


Ala 


Phe 


Asp 


He 






120 










125 








Cys 


Leu 


Arg 


Asp 


Arg 


He 


Val 


Glu 


Cys 


Tyr 


Phe 




135 










140 










Gly 


Tyr 


Glu 


Pro 


Gin 


Tyr 


Ser 


Arg 


Ala 


Arg 


Val 


150 










155 










160 


Val 


Ala 


Val 


He 


Thr 


Leu 


He 


Asp 


Asp 


Thr 


Phe 



170 175 



oauuy yu i — v-j ^- J 


a. a ^ a ^-4 a. i— ca ^ c 


gaatgattta 


aaaaaagaag 


tgeggcaac t 


act aaaagaa 


180 




L l v_- l_ a y a a 


a v — ■ a i~ ^ — ■ v — ■ ci a i— 


\^ l~ V— VJ ci a ^-4 w 


tgattgatga 


aattcaaege 


240 


L. l Lyy ad Lao 


Pn't - at~papt"'t~ 
L.y La.LL.aL-L L 


LyaaLyy yay 


a1~1 - rTat"r^a"t~n 
aLLyaLL-aLy 


rattriraat a 

\— ' Q V_ L_ V^r CL d \— ^ 


tatttatgaa 


300 


aP'a'^a'^nnl'n 

a l. d Let l y y l y 


a LauL L.y yaa 


i_ y y LyauLy ^ 


L v — i_ v_ L. i_.ai_ 


prnt""} - PPTlt" r"t* 


tatacoaaaci 

k_ CL C y ^ — ' CL CL CL ^ 


360 


taayya L a l L 


a t" nt" t~ a P a t~ <T 
a Ly L LaLaLy 


Lya.LyLL lll. 


eta Laav_,LaLa 


ci o. y cl ci a cz ci ci 


f- ppa rippf t" P 

^ CL ' V_ V- ^ — . 


420 


a afrnAsf pat" 


l. a. ^-4 w l_ ci a c o. 


> v v-j ci a u v-^ \— 


ttnrt taaat 

V_ L. ^-A a ^-j 


tataraaaac 

^ \^ U CL v-^ 


aacttctatg 


480 




nnns na t"l" ^ t" 
yyyayaLLaL 


a L_ LCLUQQUu I— 


not" pftrrat" t" 
ywL.wLi.yywL 


f t 3 pp ppis t r 


f potrtfaar 

c> u C' ' w \ cl \^ ^ 


540 


a v_- U- a L. y a. a a 


A^rr^"hrrr , t"'l _ t" 

a a ^-J a v_ v_* l l u 


\— i_ ^ — - cl * a a a n — • 


ppprrptptfi" 


ttacrcaaat 

V-^ CL V*^ CL CL C- 


acaaegggea 


600 


L- Laafly oaau 


llll l Lyyaa 


asnn'h't'npPA 
aayy l i—yov^a 


a pia at" a rr a cr n 
a y a a i.ay ay y 


PPPPPTPPPlt" Pi 


ratlrctttr 


660 


L a LUdaLclaL. 


aarrai~'l~p"t~P'a 
aaya l. l l a 


1" ^ 3 ^ na 

i_. a a v . a a y a l. 


ttapft^aap 

( v cl o i_ d a a w 


ttpfptaarrt t 

\— i— a a C- 


aaaattcaat 

a ^-A ci y C C *w CL CL c. 


720 


l-'t-ppi-i-pap - )- 


cl c. y \_» a * — ci a 


yyaayayo <— \w 


an^pat-n4-(-T-|- 
a w w w oi u ^ w o 


w ci ci ci c ^ y ^- w 


gaaagctttc 


780 


f a f a rr^ 

Cl I a. CL a \Jt a 


a v4 a a w ^ a * v— * 


ftatttaaaa 


aataaaatta 


ttgaatgeta 


c ttt tgggga 


840 


p1~app/i - 'l - p;^p/ 
l. Lay y l l >— • a y 


apt - 3"t" as p;pp 
y v- i— a l y a y ^ - 


ap^nf^i"!" pp 

av^ay ci i_ i— v-^ 


u CL u CL v-J 


ttttcttcac 


aaaagct g t t 


900 




ptpttAtpjna 


l_ V4 a w CL V-' V_ L- CL v_ 


gatgegtatg 


gtacttatga 


agaacttaag 


960 


a L L. L L LaULy 


aay Liy l l y a 


aayy Lyy ll-o. 


ai~t"ar , at _ rip , i" 
a l i_ a a y \ i— 


tan^papapl" 

i_ a y a \^ a a v— - i— 


-f- ppanaatar 

^ ci a a c cl ^— ^ 


1020 


a +* n a a a n /** Pt a 
a Ly aaaLLya 


La LaLada L L 


a 4- t" pat" rrrta t" 
a L LLaLyyaL 


anatrananan 
aOo. LaLdLay 


acta l y y a a y a 


a f f f pf" t PIP3 
aLLLL-LLy^a 


1080 


aayyayyyda 


yaaL,ay a LL-L 


sff taapfrrp 
a l, i_ l a au l y i— 


rrrrrafl a naat 

v_j vj ci a a w a a i_ 


1~ 1" at - aa aaoa 

l l y u u a ci ci w ci 


at t* 1" at* t" aaa 


1140 


a a.PPt"nat~rYrT 

a a l v_< l. y a l y y 


l_ i_yaayv^aaa 


a "f~ rroTrr^^ a a +■ 
a i— y y y a a a l_ 




V_ CL w v^r a CL w ' CL 


tcraacracrcati 

^ Cl ^ wl 


1200 


rr^tTrsnt" i~n 
y a. c v-> v«> a y <— c. y 


faa'l'r'at'i'ap 

^ a. a c a i— ^ a 


1" nci c rjrjt" rich 


aacctgetta 


caacaacttg 


ttatcttggc 


1260 


a t" n a n +" rra 1" 3 

a Lyay Ly a La 


La L LL.aL,aaa 


ana ni" nt p 
dy ay L L» L y L L. 


n a a t" nrrrfpi" n 
ydd tyy y l. Ly 


LOLL-LyL.a.L'L. 


tppt pt*t ttt 

LOL-LL-L LLLL 


1320 


ay a Lao LL-ay 


y l a l a L- l Lyy 


LLyaLyLL La 


aa"t _ rTa"t~r , 1 - r > a 
aa i— y a i_ a 


f- pa P PPP p a a 
l y a *w w ci v cl ci. 


u ^.j ci ^-H a a 


1380 


yaaayaaaaL 


a-t~an'l - i~pa'! _ p 
a Lay L LL-aLL- 


rrarrr i f^'i~"t _ rTa a 
yay lll lyaa 


ay L.L_aLa^,ya 


^nna^'l'rii'aa 

ayyaa c>a i_aa 


t at paataan 

t-.y i_^aa t-yciy 


14 40 


y a. y La uy lll 


aaaprft" n a t~ 
aaaLL L l y a. L 


L LaOdayyaa 


o1~arfaarTa't _ rr 
y Layaayaoy 


trrtrTfiflaana 

v_y cyyaaaya 


fataaarrna 

i ci l_ cl ci a w w ci 


1500 


n a n t~ a /"» p "i - pa 
y ay LaOL-LL-a 


Luu ^ laadcla 


L>a l LLLaayy 


p /~< /-T 1~ "i - a tp tp pra 
L-L-y l La l Lya 


f- /-frfpt p/t" pa t~ 
LyyLLy tydu 


ptaf t" t" ptp PP 
» c cl c- c y cy^< 


1560 


p , ant~ H t~"l - p''l - 'l _ n 
Lay LLLLLiy 


a ^ ni" f pp afa 
acty l l l a a La 


LyLdyy oday 


nafaapf tpa 
y a i_ a a o i_ v_- a 


p3Pn1"3l~(Trfrr 

uu^y Lui»yyy 


aoarnaatar 

cl u ci ^ ^ a a c a. v • 


1620 


aaaCaLCLCa 


L aady LLlLL 


dL LLy LLLaL 


ppfat" n a n 4~ a 
LLLdtydy La 


La LydLLaLL 


natpptt pnt 
aa LLL L LLy L 


1680 


gCa Lay CC La 


LCaatLatdL 


Lyaddgyy ll 


ddLLdtyLdL 


n't ptpt ■=) 4- nn 
y LL LL-LaLyy 


dydy dd l l ll 


17 4 0 


tcaagctatt 


tggtgtttct 


tgctggcaat 


aataaatcag 


aegcataaaa 


ttgtattgaa 


1800 


etatatgecg 


atagctattt 


aaagttatta 


tacaactaaa 


atattcaaca 


atggtattat 


1860 


acttttactt 


tgtacaaaag 


caaaagtaca 


ctactgttat 


gtaacatttt 


agttctatga 


1920 


tactttagtt 


aegaategge 


ttatatacat 


tgatacactt 


ttatgcagaa 


aaccctagta 


1980 



aataaaaagt cgatatcttg tactacacat atcgcacgaa tttccgtttg ccgtttgtat 2040 

tttacgatat gttatttaat gaatatgttt catgtggttg ttgcttaaaa aaaaagtcga 2100 

cgcggccgcg aa 2112 

<210> 12 
<211> 697 
<212> PRT 

<213> Artemisia annua L. 
<220> 

<223> Deduced amino acid sequence of a positive clone (amorphadiene 
synthase encoding gene) isolated from the cDNA library of 
induced A. annua 

<400> 12 

Asn Ser Arg Pro Arg Arg Gin lie Met Ser Leu Thr Glu Glu Lys Pro 

1 5 10 t 15 

lie Arg Pro lie Ala Asn Phe Pro Pro Ser lie Trp Gly Asp Gin Phe 

20 25 30 

Leu lie Tyr Gin Lys Gin Val Glu Gin Gly Val Glu Gin lie Val Asn 

35 40 45 

Asp Leu Lys Lys Glu Val Arg Gin Leu Leu Lys Glu Ala Leu Asp lie 

50 55 60 

Pro Met Lys His Ala Asn Leu Leu Lys Leu lie Asp Glu lie Gin Arg 
65 70 75 80 

Leu Gly lie Pro Tyr His Phe Glu Arg Glu lie Asp His Ala Leu Gin 

85 90 95 

Cys lie Tyr Glu Thr Tyr Gly Asp Asn Trp Asn Gly Asp Arg Ser Ser 

100 105 110 

Leu Trp Phe Arg Leu Met Arg Lys Gin Gly Tyr Tyr Val Thr Cys Asp 

115 120 125 

Val Phe Asn Asn Tyr Lys Asp Lys Asn Gly Ala Phe Lys Gin Ser Leu 

130 135 140 

Ala Asn Asp Val Glu Gly Leu Leu Glu Leu Tyr Glu Ala Thr Ser Met 
145 150 155 160 

Arg Val Pro Gly Glu lie lie Leu Glu Asp Ala Leu Gly Phe Thr Arg 

165 170 175 

Ser Arg Leu Ser lie Met Thr Lys Asp Ala Phe Ser Thr Asn Pro Ala 

180 185 190 

Leu Phe Thr Glu lie Gin Arg Ala Leu Lys Gin Pro Leu Trp Lys Arg 

195 200 205 

Leu Pro Arg lie Glu Ala Ala Gin Tyr lie Pro Phe Tyr Gin Gin Gin 

210 215 220 

Asp Ser His Asn Lys Thr Leu Leu Lys Leu Ala Lys Leu Glu Phe Asn 
225 230 235 240 

Leu Leu Gin Ser Leu His Lys Glu Glu Leu Ser His Val Cys Lys Trp 

245 250 255 

Trp Lys Ala Phe Asp lie Lys Lys Asn Ala Pro Cys Leu Arg Asp Arg 

260 265 270 

lie Val Glu Cys Tyr Phe Trp Gly Leu Gly Ser Gly Tyr Glu Pro Gin 

275 280 285 

Tyr Ser Arg Ala Arg Val Phe Phe Thr Lys Ala Val Ala Val lie Thr 

290 295 300 

Leu lie Asp Asp Thr Tyr Asp Ala Tyr Gly Thr Tyr Glu Glu Leu Lys 
305 310 315 320 

lie Phe Thr Glu Ala Val Glu Arg Trp Ser lie Thr Cys Leu Asp Thr 

325 330 335 

Leu Pro Glu Tyr Met Lys Pro lie Tyr Lys Leu Phe Met Asp Thr Tyr 



V 









340 










345 










350 






Thr 


Glu 


Met 


Glu 


Glu 


Phe 


Leu 


Ala 


Lys 


Glu 


Gly 


Arg 


Thr 


Asp 


Leu 


Phe 






355 










360 










365 








Asn 


Cys 


Gly 


Lys 


Glu 


Phe 


Val 


Lys 


Glu 


Phe 


Val 


Arg 


Asn 


Leu 


Met 


Val 




370 










375 










380 










Glu 


Ala 


Lys 


Trp 


Ala 


Asn 


Glu 


Gly 


His 


He 


Pro 


Thr 


Thr 


Glu 


Glu 


His 


385 










390 










395 










400 


Asp 


Pro 


Val 


Val 


He 


lie 


Thr 


Gly 


Gly 


Ala 


Asn 


Leu 


Leu 


Thr 


Thr 


Thr 










405 










410 










415 




Cys 


Tyr 


Leu 


Gly 


Met 


Ser 


Asp 


He 


Phe 


Thr 


Lys 


Glu 


Ser 


Val 


Glu 


Trp 








420 










425 










430 






Ala 


Val 


Ser 


Ala 


Pro 


Pro 


Leu 


Phe 


Arg 


Tyr 


Ser 


Gly 


lie 


Leu 


Gly 


Arg 






435 










440 










445 








Arg 


Leu 


Asn 


Asp 


Leu 


Met 


Thr 


His 


Lys 


Ala 


Glu 


Gin 


Glu 


Arg 


Lys 


His 




450 










4 55 










460 










Ser 


Ser 


Ser 


Ser 


Leu 


Glu 


Ser 


Tyr 


Met 


Lys 


Glu 


Tyr 


Asn 


Val 


Asn 


Glu 


465 










470 










475 










480 


Glu 


Tyr 


Ala 


Gin 


Thr 


Leu 


lie 


Tyr 


Lys 


Glu 


Val 


Glu 


Asp 


Val 


Trp 


Lys 










485 










490 










495 




Asp 


lie 


Asn 


Arg 


Glu 


Tyr 


Leu 


Thr 


Thr 


Lys 


Asn 


lie 


Pro 


Arg 


Pro 


Leu 








500 










505 










510 






Leu 


Met 


Ala 


Val 


He 


Tyr 


Leu 


Cys 


Gin 


Phe 


Leu 


Glu 


Val 


Gin 


Tyr 


Ala 






515 










520 










525 








Gly 


Lys 


Asp 


Asn 


Phe 


Thr 


Arg 


Met 


Gly 


Asp 


Glu 


Tyr 


Lys 


His 


Leu 


lie 




530 










535 










540 










Lys 


Ser 


Leu 


Leu 


Val 


Tyr 


Pro 


Met 


Ser 


lie 


Leu 


Pro 


lie 


Leu 


Arg 


Ala 


545 










550 










555 










560 


Pro 


He 


Asn 


Tyr 


He 


Glu 


Arg 


Val 


Asn 


Tyr 


Ala 


Arg 


Leu 


Tyr 


Gly 


Glu 










565 










570 










575 




Asn 


Phe 


Ser 


Ser 


Tyr 


Leu 


Val 


Phe 


Leu 


Ala 


Gly 


Asn 


Asn 


Lys 


Ser 


Asp 








580 










585 










590 






Ala 


Asn 


Cys 


He 


Glu 


Leu 


Tyr 


Ala 


Asp 


Ser 


Tyr 


Leu 


Lys 


Leu 


Leu 


Tyr 






595 










600 










605 








Asn 


Asn 


He 


Gin 


Gin 


Trp 


Tyr 


Tyr 


Thr 


Phe 


Thr 


Leu 


Tyr 


Lys 


Ser 


Lys 




610 










615 










620 










Ser 


Thr 


Leu 


Leu 


Leu 


Cys 


Asn 


lie 


Leu 


Val 


Leu 


Tyr 


Phe 


Ser 


Tyr 


Glu 


625 










630 










635 










640 


Ser 


Ala 


Tyr 


He 


His 


Tyr 


Thr 


Phe 


Met 


Gin 


Lys 


Thr 


Leu 


Val 


Asn 


Lys 










645 










650 










655 




Lys 


Ser 


He 


Ser 


Cys 


Thr 


Thr 


His 


lie 


Ala 


Arg 


lie 


Ser 


Val 


Cys 


Arg 








660 










665 










670 






Leu 


Tyr 


Phe 


Thr 


He 


Cys 


Tyr 


Leu 


Met 


Asn 


Met 


Phe 


His 


Val 


Val 


Val 






675 










680 










685 








Ala 


Lys 


Lys 


Ser 


Arg 


Arg 


Gly 


Arg 


Glu 

















690 695 



<210> 13 
<211> 1649 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleotide sequence of the amorphadiene synthase encoding gene, 
between start and stop codon, obtained by PCR with primers C 
and D 

<400> 13 

ccatggcact tacagaagaa aaacctattc gccccattgc caactttcct ccaagcattt 60 



ggggagatca gtttctcatc tatcaaaagc aagtagagca aggggtggaa cagatagtga 120 



atgatttaaa 


aaaagaagtg 


cggcaactac 


taaaagaagc 


tttggatatt 


cctatgaaac 


180 


atgccaattt 


qttqaaqctq 


attgatgaaa 


ttcaacgcct 


tggaataccg 


tatcactttg 


240 


aacqqqaqat 

ZJ zj zj zj 


tgatcatgca 


ttgcaatgta 


tttatgaaac 


atatggtgat 


aactggaatg 


300 


gtgaccgctc 


ttccttatgg 


ttccgtctta 


tgcgaaagca 


aggatattat 


gttacatgtg 


360 


atgttttcaa 


taactataaa 


gacaaaaatg 


gagcgttcaa 


gcaatcgtta 


gctaatgatg 


420 


ttgaaggttt 


gcttgagttg 


tacgaagcaa 


cttctatgag 


qqtacctqqq 


gagattatat 


480 


tagaagatgc 


tcttggtttt 


acacgatct c 


gtcttagcat 


tatgacaaaa 


gatgcttttt 


540 


ctacaaaccc 


cgctcttttt 


accgaaatac 


aacgggcact 


aaagcaaccc 


ctttggaaaa 


600 


ggttgccaag 


aatagaggcg 


gcgcagtaca 


ttcctttcta 


tcaacaacaa 


gattctcata 


660 


acaagacttt 


acttaaactt 


gctaagttag 


agttcaattt 


gcttcagtca 


ttgcacaagg 


720 


aagagctcag 


ccatgtgtgc 


aaatggtgga 


aagctttcga 


tatcaagaag 


aacgcacctt 


780 


gtttaagaga 


tagaattgtt 


gaatgctact 


tttggggact 


aggttcaggc 


tatgagccac 


840 


agtat tcccg 


ggctagagtt 


ttcttcacaa 


aagctgttgc 


tgttataact 


cttatagatg 


900 


acacttatga 


tgcgtatggt 


acttatgaag 


aacttaagat 


ctttactgaa 


gctgttgaaa 


960 


qqtqqtcaat 

ZJ ZJ ZJ ZJ 


tacatgctta 


gacacacttc 


cagaatacat 


gaaaccgata 


tacaaattat 


1020 


tcatggatac 


atacacagaa 


atggaagaat 


ttcttgcaaa 


qqaqqqaaqa 


acagatctat 


1080 


ttaactgcgg 


caaagaattt 


qtqaaaqaqt 

t3 3 J3 ZJ 


ttgttagaaa 


cctgatggtt 


gaagcaaaat 


1140 


qqqcaaat qa 

ZJ ZJ Z3 ^3 


gggacacat a 


ccaaccact g 


aagagcatga 


tccagttgta 


atcattactg 


1200 


qcqqtqctaa 

zj zj zj " 


cctgcttaca 


acaacttgtt 


at cttggcat 


gagtgatata 


ttcacaaaag 


1260 


agt ctgtcga 


atqqqctqt c 

^ zj zj zj 


tctgcacct c 


ctctttttag 


atactcaggt 


atacttggtc 


1320 


gacgcctaaa 


tgatctcatg 


acccacaagg 


ccgagcaaga 


aagaaaacat 


agttcatcga 


1380 


gccttgaaag 


ttatatgaag 


gaatataatg 


tcaatgagga 


gtatgcccaa 


accttgattt 


1440 


acaaggaagt 


agaagatgtg 


tggaaagata 


taaaccgaga 


gtacctcaca 


actaaaaaca 


1500 


ttccaaggcc 


gttattgatg 


gctgtgatct 


atttgtgcca 


gtttcttgaa 


gttcaatatg 


1560 


caggaaagga 


taacttcaca 


cgtatgggag 


acgaatacaa 


acatctcata 


aagtctctac 


1620 


tcgtttatcc 


tatgagtata 


tgaggatcc 








1649 



<210> 14 
<211> 549 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Deduced amino acid sequence of the amorphadiene synthase 

encoding gene, between start and stop codon, obtained by PCR 
with primers C and D 



<400> 14 

Thr Met Ala Leu 

Pro Pro Ser lie 
20 

Glu Gin Gly Val 
35 

Gin Leu Leu Lys 
50 

Leu Lys Leu lie 
65 

Glu Arg Glu lie 

Asp Asn Trp Asn 
100 

Lys Gin Gly Tyr 
115 

Lys Asn Gly Ala 
130 

Leu Glu Leu Tyr 
145 

Leu Glu Asp Ala 

Lys Asp Ala Phe 
180 

Ala Leu Lys Gin 
195 

Gin Tyr lie Pro 
210 

Leu Lys Leu Ala 
225 

Glu Glu Leu Ser 

Lys Asn Ala Pro 
260 

Gly Leu Gly Ser 
275 

Phe Thr Lys Ala 
290 

Ala Tyr Gly Thr 
305 

Arg Trp Ser He 

He Tyr Lys Leu 
340 

Ala Lys Glu Gly 
355 

Lys Glu Phe Val 
370 

Gly His He Pro 
385 

Gly Gly Ala Asn 

He Phe Thr Lys 
420 

Phe Arg Tyr Ser 
435 

His Lys Ala Glu 
450 

Tyr Met Lys Glu 



Thr Glu Glu Lys 
5 

Trp Gly Asp Gin 

Glu Gin He Val 
40 

Glu Ala Leu Asp 
55 

Asp Glu He Gin 
70 

Asp His Ala Leu 
85 

Gly Asp Arg Ser 

Tyr Val Thr Cys 
120 

Phe Lys Gin Ser 
135 

Glu Ala Thr Ser 
150 

Leu Gly Phe Thr 
165 

Ser Thr Asn Pro 

Pro Leu Trp Lys 
200 

Phe Tyr Gin Gin 
215 

Lys Leu Glu Phe 
230 

His Val Cys Lys 
245 

Cys Leu Arg Asp 

Gly Tyr Glu Pro 
280 

Val Ala Val He 
295 

Tyr Glu Glu Leu 
310 

Thr Cys Leu Asp 
325 

Phe Met Asp Thr 

Arg Thr Asp Leu 
360 

Arg Asn Leu Met 
375 

Thr Thr Glu Glu 
390 

Leu Leu Thr Thr 
405 

Glu Ser Val Glu 

Gly He Leu Gly 
440 

Gin Glu Arg Lys 
455 

Tyr Asn Val Asn 



Pro lie Arg Pro 
10 

Phe Leu He Tyr 
25 

Asn Asp Leu Lys 

He Pro Met Lys 
60 

Arg Leu Gly He 
75 

Gin Cys He Tyr 
90 

Ser Leu Trp Phe 
105 

Asp Val Phe Asn 

Leu Ala Asn Asp 
140 

Met Arg Val Pro 
155 

Arg Ser Arg Leu 
170 

Ala Leu Phe Thr 
185 

Arg Leu Pro Arg 

Gin Asp Ser His 
220 

Asn Leu Leu Gin 
235 

Trp Trp Lys Ala 
250 

Arg He Val Glu 
265 

Gin Tyr Ser Arg 

Thr Leu He Asp 
300 

Lys He Phe Thr 
315 

Thr Leu Pro Glu 
330 

Tyr Thr Glu Met 
345 

Phe Asn Cys Gly 

Val Glu Ala Lys 
380 

His Asp Pro Val 
395 

Thr Cys Tyr Leu 
410 

Trp Ala Val Ser 
425 

Arg Arg Leu Asn 

His Ser Ser Ser 
460 

Glu Glu Tyr Ala 



He Ala Asn Phe 
15 

Gin Lys Gin Val 
30 

Lys Glu Val Arg 
45 

His Ala Asn Leu 

Pro Tyr His Phe 
80 

Glu Thr Tyr Gly 
95 

Arg Leu Met Arg 
110 

Asn Tyr Lys Asp 
125 

Val Glu Gly Leu 

Gly Glu He lie 
160 

Ser He Met Thr 
175 

Glu He Gin Arg 
190 

He Glu Ala Ala 
205 

Asn Lys Thr Leu 

Ser Leu His Lys 
240 

Phe Asp lie Lys 
255 

Cys Tyr Phe Trp 
270 

Ala Arg Val Phe 
285 

Asp Thr Tyr Asp 

Glu Ala Val Glu 
320 

Tyr Met Lys Pro 
335 

Glu Glu Phe Leu 
350 

Lys Glu Phe Val 
365 

Trp Ala Asn Glu 

Val He He Thr 
400 

Gly Met Ser Asp 
415 

Ala Pro Pro Leu 
430 

Asp Leu Met Thr 
445 

Ser Leu Glu Ser 

Gin Thr Leu lie 



465 










470 










47 5 










Tyr 


Lys 


Glu 


Val 


Glu 


Asp 


Val 


Trp 


Lys 


Asp 


lie 


Asn 


Arg 


<jl u 


T\/T~ 






4 85 










h y u 










4 95 


Thr 


Thr 


Lys 


Asn 


lie 


Pro 


Arg 


Pro 


Leu 


Leu 


Mot 


Ala 


Val 


He 


Tvr 






500 




















510 




Cys 


Gin 


Phe 


Leu 


Glu 


Val 


Gin 


Tyr 


Ala 


Gly 


Lys 


Asp 


Asn 


Phe 


Thr 




515 










520 










525 






Met 


Gly 


Asp 


Glu 


Tyr 


Lys 


His 


Leu 


He 


Lys 


Ser 


Leu 


Leu 


Val 


Tyr 




530 










535 










540 








Met 


Ser 


He 


Gly 


Ser 






















545 































480 



